Parameter Setting for Evolutionary Latent Class Clustering

نویسندگان

  • Damien Tessier
  • Marc Schoenauer
  • Christophe Biernacki
  • Gilles Celeux
  • Gérard Govaert
چکیده

The latent class model or multivariate multinomial mixture is a powerful model for clustering discrete data. This model is expected to be useful to represent non-homogeneous populations. It uses a conditional independence assumption given the latent class to which a statistical unit is belonging. However, it leads to a criterion that proves difficult to optimise by the standard approach based on the EM algorithm. An Evolutionary Algorithms is designed to tackle this discrete optimisation problem, and an extensive parameter study on a large artificial dataset allows to derive stable parameters. Those parameters are then validated on other artificial datasets, as well as on some well-known real data: the Evolutionary Algorithm performs repeatedly better than other standard clustering techniques on the same data.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Clustering and combining pattern of metabolic syndrome components among Iranian population with latent class analysis

  Background: Metabolic syndrome (MetS), a combination of coronary heart disease and diabetes mellitus risk factor, refer to one of the most challenging public health issues in worldwide. The aim of this study was to identify the subgroups of participants in a study on the basis of MetS components.   Methods: The cross-sectional study took place in the districts related to Teh...

متن کامل

Evolutionary User Clustering Based on Time-Aware Interest Changes in the Recommender System

The plenty of data on the Internet has created problems for users and has caused confusion in finding the proper information. Also, users' tastes and preferences change over time. Recommender systems can help users find useful information. Due to changing interests, systems must be able to evolve. In order to solve this problem, users are clustered that determine the most desirable users, it pa...

متن کامل

A partition-based algorithm for clustering large-scale software systems

Clustering techniques are used to extract the structure of software for understanding, maintaining, and refactoring. In the literature, most of the proposed approaches for software clustering are divided into hierarchical algorithms and search-based techniques. In the former, clustering is a process of merging (splitting) similar (non-similar) clusters. These techniques suffered from the drawba...

متن کامل

Analyzing Motorcycle Crash Pattern and Riders’ Fault Status at a National Level: A Case Study from Iran

Motorcycle crashes constitute a significant proportion of traffic accidents all over the world. The aim of this paper was to examine the motorcycle crash patterns and rider fault status across the provinces of Iran. For this purpose, 6638 motorcycle crashes occurred in Iran through 2009-2012 were used as the analysis data and a two-step clustering approach was adopted as the analysis framework....

متن کامل

Latent Class Analysis of the cardiometabolic risk factors in children and adolescents: the CASPIAN-V study

Background: Cardio-metabolic syndrome indicates the clustering of several risk factors. The aims of this study were to identify the subgroups of the Iranian children and adolescents on the basis of the components of the cardio-metabolic syndrome and assess the role of demographic characteristics, socioeconomic status and life style related behaviors on the membership of participants in each lat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007